# Metadata for the PRIVEE dataset

## Description
This dataset contains a list of all the open datasets we have collected through the Socrata API and is used in developing the PRIVEE interface. We have enriched the dataset with metadata information of these datasets, including their columns, tags, and the number of rows. We have also identified some of the quasi-identifiers present in these datasets.

## Columns in this dataset
- Dataset
	- Description: name of the dataset
	- Format: string
- Domain
	- Description: data portal where the dataset is available
	- Format: string
- Permalink
	- Description: permanent link to the dataset
	- Format: string/URL
- Columns Name
	- Description: all the columns of the dataset
	- Format: array
- Columns Field Name
	- Description: all the unique non-spaced versions of the column names
	- Format: array
- Quasi Identifier Present
	- Description: corresponding columns for the quasi-identifiers age, gender, race, location, if any
	- Format: dictionary
- Number of Rows
	- Description: number of rows present in the dataset
	- Format: number
- Tags
	- Description: domain tags supplied by the data portal
	- Format: array
	- Notes: some may be empty
- Type
	- Description: record granularity of the dataset (individual/aggregated)
	- Format: string

## Meta
- First downloaded on: 20th July, 2021
- Subsequent enrichment and updates: July 2021 - May 2022
- Published on: 7th August, 2022